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Abstract — The min-sum (MS) algorithm is arguably the second 
most fundamental algorithm in the realm of message passing due 
to its optimality (for a tree code) with respect to the block error 
probability [1]. There also seems to be a fundamental relationship 
of MS decoding with the linear programming decoder [2]. Despite 
its importance, its fundamental properties have not nearly been 
studied as well as those of the sum-product (also known as BP) 
algorithm. 

We address two questions related to the MS rule. First, we 
characterize the stability condition under MS decoding. It turns 
out to be essentially the same condition as under BP decoding. 
Second, we perform a degree distribution optimization. Contrary 
to the case of BP decoding, under MS decoding the thresholds 
of the best degree distributions for standard irregular LDPC 
ensembles are significantly bounded away from the Shannon 
threshold. More precisely, on the AWGN channel, for the best 
codes that we find, the gap to capacity is ldB for a rate 0.3 
code and it is 0.4dB when the rate is 0.9 (the gap decreases 
monotonically as we increase the rate). 

We also used the optimization procedure to design codes for 
modified MS algorithm where the output of the check node is 
scaled by a constant 1/ot. For a= 1.25, we observed that the 
gap to capacity was lesser for the modified MS algorithm when 
compared with the MS algorithm. However, it was still quite 
large, varying from 0.75 dB to 0.2 dB for rates between 0.3 and 
0.9. 

We conclude by posing what we consider to be the most 
important open questions related to the MS algorithm. 

I. Introduction 

The min-sum (MS) decoder is perhaps the second most 
fundamental message passing decoder after Belief Propagation 
(BP) decoder for two main reasons. Firstly, the MS decoder is 
optimal with respect to block error probability on a tree code 
[1]. Secondly, it is widely believed that the MS decoder is 
closely related to the linear programming (LP) based decoder 
proposed in [12]. In [2], a complete characterization of the 
decoding region of the LP decoder has been provided with 
respect to the pseudocodewords of the underlying bipartite 
graph. The results in [13] suggest that the decoding region of 
the LP decoder is identical to that of the MS decoder (indeed, 
this is the case for tree codes). In addition, the MS decoder 
is of practical interest because of its low implementation 
complexity. 

In [11], the asymptotic performance of the MS decoder 
using density evolution was evaluated. Not much is known, 



however, analytically about the density evolution behavior of 
the MS decoder as compared to BP. 

We first address the issue of stability of the MS decoder. 
In particular, we derive a condition which guarantees that the 
densities corresponding to the MS decoder which one observes 
in density evolution converge to an "error-free" density. This 
condition turns out to be essentially the same as the stability 
condition for BP. 

Recall that for the BP decoder the space of densities which 
arise in the context of density evolution is the space of 
symmetric densities. Under MS decoding, on the contrary, no 
equivalent condition is known. Empirically, one observes that 
for y > the densities fulfill the inequality 

a(y)e~ y < a(-y) < a(y). 

We show that such a bound indeed stays preserved under MS 
processing at the check nodes. The equivalent question at the 
variable nodes is an open question. 

What are the fundamental performance limits under MS 
decoding? Under BP decoding an explicit optimization of the 
degree distribution shows that we can seemingly get arbitrarily 
close to capacity by a proper choice of the degree distribution. 
Is the same behavior true under MS decoding or are the 
fundamental limits which can not be surpassed? In order to 
address this question we implemented an optimization tool 
based on EXIT charts. We found that the gap between the 
best code and Shannon limit is rather large. 

In [7] some simple improvements are proposed to the 
MS decoder. For some examples, it is demonstrated that 
by a simple scaling of the output at the check nodes, the 
performance of the MS decoder can be brought closer to that 
of the BP decoder. Using the LDPC code design procedure, 
we also study how close we can get to the Shannon capacity 
limit by using this modified MS algorithm. 

The paper is organized as follows. In Section [II] we give 
relevant definitions and briefly review the MS decoding al- 
gorithm and its density evolution analysis. In Section [TTTJ 
we derive a sufficient condition for stability and also discuss 
some properties of density which arise in density evolution. 
In Section IIV1 we discuss the optimization procedure. We 
then present the optimization results in Section [V] and finally 
conclude in Section [yT] 



II. Definitions and Preliminaries 

The LDPC ensemble is specified by specifying X(x) = 
Y,^i x ' 1 an d p( x ) = HPi x ' 1 which represent the degree dis- 
tribution (dd) of the bit nodes and check nodes in the edge 
perspective, i.e., X, (p,) is the fraction of edges connected 
to a degree i bit (check) node. The design rate of an LDPC 
ensemble is given by 1 — Ly/Ly- 

We consider transmission over a binary-input, memoryless, 
and symmetric (BMS) channel. Let L c h. u be the log-likelihood 
ratio (LLR) of bit u obtained from the channel observation 
corresponding to bit u. Let L *l and L^ u be the check 
to bit and bit to check message at iteration t corresponding 
to edge (u,v). We will sometimes specifically refer to the 
binary input AWGN (biAWGN) channel, Y = (1 - 2X) +N, 
where X e {0,1} is the input bit and N has a Gaussian 
distribution with mean and variance a 2 . In this case L t /, 
is given by 2Y / a 2 and its distribution under the all zero code 
word assumption is Gaussian with mean 2 /a 2 and variance 
4/o 2 . Finally, we denote the Bhattacharyya constant associated 
to density a by B (a) = J" 00 a(x)e _ ic/x and error probability by 
P e (a) = J oo a(x)dx+ j J °- a(x)dx. 

We now discuss the message passing rules for the MS 
decoder. In MS decoder the bit to check message update is 
given by 
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where £ is the set of edges. The check to bit message update 
equation is 
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For the MS decoder a = 1, but we will also consider modified 
MS decoders with a > 1 . 

The asymptotic performance of LDPC codes under MS 
decoding can be characterized by studying the evolution of the 
density of the messages with iterations (see [9]). Let a c /,(Z), 
b t (l), and a, (7) be the probability density function (pdf) of 
channel log-likelihood ratio, the message from check to bit 
and bit to check node respectively in f th iteration under the all 
zero codeword assumption. 

The density evolution equation for the bit node (correspond- 
ing to (Q]i) is given by 



a f (/) = a e *(/)®£^(b ( _i(/)) 
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where a®' denotes convolution of a with itself / times. Sim- 
ilarly the check node side operation on densities is denoted 
by KL The pdf of the message at the output of check nodes 
employing MS (corresponding to has been derived in [7], 
[11]. It is given by 
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+ (a,(Z)- a, (-/))( / (a,(je)-a,(-je))<fc 
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The density evolution process is started with bo(Z) = 5o(/) and 
iterative decoding is successful if the densities eventually tend 
to &»(/). 

III. Stability Condition and Some Properties of 
the Densities 

In this section we derive the stability condition under MS 
decoding. The stability condition guarantees that if the density 
in density evolution reaches "close" to error free density 
(8oo(/)) then it converges to it. We derive the stability condition 
by upper bounding the evolution of the Bhattacharyya parame- 
ter in density evolution. Note that the Bhattacharyya parameter 
appears naturally in the context of BP where densities are 
symmetric. In this case the Bhattacharyya parameter has a 
very concrete meaning: it is equal to — lim„^oo ^ log (P e (a®")), 
where a is a symmetric density. For general densities which are 
not symmetric this is no longer true but we can always com- 
pute B (a) = J" ao a(x)e^^dx. The reason we use Bhattacharyya 
parameter is to have a one dimensional representation of 
densities and because of its property of being multiplicative 
on the variable node side. 

In the following lemma we give a sufficient condition for 
stability of 8<»(/). This condition turns out to be same as the 
stability condition for BP (Theorem 5, [10]). 

Lemma 1: Assume we are given a degree distribution 
pair (A-,p) and that transmission takes place over a BMS 
channel characterized by its L-density a t /,. Define ao = 
a c h, and for t > 1, define a, = a c /, ©A.(p(a,_i)) = a c h © 
aOt-iA®^'- 1 ) 
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(4) 



then there exists a strictly positive constant £, = ^(A,,p,a c /j) 
such that if, for some f£N,B(a,)<^ then B (a t+ „) as well as 
P e (a r+ „) converge to zero as n tends to infinity. Conversely, if 
B (a c h) A,'(0)p'(l) > 1 then liminff^Pe (a f ) > with a = a ch . 
Proof: By Lemma [3] in Appendix we know that 
< (k-\)B{z t ). Thus 



B a," 



B(a t+1 )=B(a ch )X [B J>(a t f ^ 



<B(a ch )X(p'(l)B(a t )). 
Expanding the last equation around zero, we get 

,2 



B(a ch )X'(0)p'(l)B(a t )+O(B{a t 



Since B(a c /,)A,'(0)p'(l) is assumed to be a constant less than 
1, we can choose a sufficiently small £, = £, (A,, p, a t /,) such that 
if S(a f ) thenB(a cA )A/(O)p'(l) + 0(a f ) <e<l. Therefore 
if for some t G N, B(a t ) < %, then B{a t+n ) < t n B{a t ), which 



converges to zero as n tends to infinity. As 

P e (a r+ „) = J a (x)dx + - a (x)dx 

< / a(jc)e J dx < B(a t+ „) , 

so P e (a f +„) also converges to zero. 

For the converse statement, the stability condition in Eqn© 
is a necessary condition for BP decoding to be successful. 
Hence by the optimality of BP decoding on a tree it is also a 
necessary condition for MS decoding to be successful. ■ 

In proving the sufficiency of the stability condition we 
used the Bhattacharyya parameter as the functional to project 
densities to one dimension. However we could have used 
any other functional of the form B a (a) = E [e _aX ] ,a > 
which is multiplicative on the variable node side. Lemma 
|3] stays valid for any such functional. Therefore, we get a 
general stability condition that reads B a (a c /,) A/(0)p'(l) < 1. 
However, as a c /,(x) is a symmetric density, B a (a c i,) >B(a c h). 
This implies that the sufficient condition for a ^ j is weaker 
than the condition corresponding to Bhattacharyya parameter. 

Note that the converse in Lemma [T] is partial. It does not 
say that the condition in Eqn(0]i is necessary for the density 
to converge to 8c*,(/) if for some t the density a, is "close" 
to 8oo(/). However the following observation suggests that this 
indeed should be the necessary condition. Suppose we evolve 
the density 2e8o(7) + (1 - 2e)Soo(/) under the MS decoder. 
Then it again follows by the arguments of Theorem 5 in 
[10]) that for the density to converge to 8oo(/) the necessary 
condition is B (a c ./,) A/(0)p'(l) < 1. For the BP decoder we 
know that 2e8o(/) + (1 -2e)8oo(/) is the "best" density (in the 
sense of degradation) with error probability e. However for the 
MS decoder this is not the case. Hence we can not conclude 
that Eqn© is a necessary condition. 

The BP densities satisfy the symmetry condition a(x) = 
a(— x)e x . The densities which arise in MS decoder do not 
satisfy the symmetry property. However, we have observed 
empirically that the densities satisfy the property that a(x) > 
a(— x) and a(x) < a{—x)e x , x > 0. In the following lemma we 
prove that these properties remain preserved on the check node 
side. 

Lemma 2: Let a(x) and b(x) be two densities which satisfy 
the property that a(x) > a(— x),b(x) > b(— x) and a(x) < 
e x a(-x),b(x) < e x b(-x) for Vx > 0. Let c(x) =(aB b)(x). 
Then c(x) > c(— x) and c(x) < e*c(— x). 

Proof: Let A and B be random variables having density 
a and b respectively. Then 

c(x) = a(x)P(B > |x|) + b(x)P(A > |x|) + 

a(-x)P(B< -|x|) + b(-x)P(A < -|x|). 

Thus 

c(x) - c(-x) = (a(x) — a(— x)) (P (B > x) - P{B < -x)) + 
(b(*)-b(-x))(P(A>x)-P(A<-*)), 
>0. 



Similarly, 

c(-jc) - e~ x c(x) = (a(-x) - e~ x z{x)) P(B > x) + 

(b(-x) - e-*b(x)) P(A > x) + (a(x) - e - x a{-x)) P(B< -x) 

+ (b(x) - e~ A b(-x)) P (A < -x) , 

which is greater than or equal to zero by the assumption. ■ 
Proving Lemma [2] for the variable node side is still an open 
question. 

IV. Optimization Procedure 
A. EXIT Charts 

EXIT charts [8] were proposed as a low complexity al- 
ternative to design and analyze LDPC codes. Typically by 
assuming that the density of the messages exchanged during 
iterative decoding is Gaussian, the problem of code design 
can be reduced to a curve fitting problem which can be done 
using linear programming. If the Gaussian assumption is exact, 
this technique is shown to be optimal in [6]. In [3], a fast 
procedure is proposed that uses a combination of EXIT charts 
and density evolution to design LDPC codes. The basic idea 
is to perform the design in steps, where, in each step, the 
LDPC code ensemble is optimized using EXIT charts using 
the densities of the messages obtained from density evolution 
of the ensemble obtained in the previous step. In this paper, 
we use a similar idea to design LDPC codes for MS decoding. 

An EXIT curve of a component decoder is a plot of 
the mutual information corresponding to the extrinsic output 
expressed as a function of the mutual information correspond- 
ing to the a priori input (message coming from the other 
component decoder). Usually, it is assumed that the a priori 
information is from an AWGN channel of signal-to-noise ratio 
1 /a 2 and the EXIT curve is obtained by calculating the input 
and output mutual information for a 2 varying from to °°. 
In an EXIT chart, the EXIT curves of one component code 
and the flipped EXIT curve of the other component code are 
plotted. Using this chart, we can predict the path taken by the 
iterative decoder as shown in Fig. Q] It has been observed that 
the actual path taken and the path predicted from EXIT charts 
are quite close. Based on this observation, LDPC codes can 
be designed as follows. 

Let Ib(lA,i) (I c (lA,i)) be mutual information corresponding 
to the extrinsic output of bit (check) node of degree i when 
the a priori mutual information is I a- The mutual information / 
can be calculated from the conditional distribution /(/) using 

The EXIT curve of the bit nodes and the check nodes is 
given by I b = £Vi.(4,0 and I c = Lp,7 c (/a,0 respectively. 
Usually both 4 and I c are increasing function of I a- The con- 
vergence condition, based on the assumption on the message 
density, states that the EXIT curve of the bit nodes should 
lie above that of the check nodes for the iterative decoder 
to converge to the correct codeword, i.e., 4(//t) > I c ~ (/a) or 
equivalently I^ X {Ia) <Ic(h) for all I A where I c (I~ l (I A )) =h- 
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Fig. 1: EXIT curves of the two component codes corresponding to the 
(3,6) LDPC code transmitted over an AWGN channel with E/,/No = 
1.3 dB 

For a fixed p(x), the problem of code design can then be stated 
as the following linear program 

max ^Jki/i 
subject to: ^^-i = 1 , > , 

Y,hh{iA,i)>i;\iA) v/ a g[o,i). (6) 

Note that maximizing the objective function corresponds to 
maximizing the rate. A similar linear program can be written 
for optimizing p(x) for a given X(x). 

B. Fixed Channel 

We consider the problem of finding LDPC codes for a given 
BMS channel such that reliable communication is possible 
with the MS decoding algorithm. We are interested here in the 
performance when the block length goes to infinity. Our goal 
is to maximize the rate of transmission. Towards achieving 
this goal we first pick an LDPC code such that it converges 
to error free density for the specified channel. 

Starting from the initial ensemble, the LDPC code ensemble 
is optimized in several steps. In each step, the basic idea is to 
design the codes using EXIT charts. However, instead of using 
the Gaussian assumption on the input densities, the input den- 
sity in a particular step of the optimization process is assumed 
to be the same as the density obtained by using the density 
evolution procedure for the ensemble obtained in the previous 
step. The inherent assumption is that the input densities do 
not change much in one step of the optimization procedure 
and therefore the approximate EXIT curves obtained using the 
previous densities are close to the actual EXIT curves. Note 
that this assumption is different from the assumption that the 
density at iteration ; for a particular optimization step is same 
as the density at iteration ; in the next optimization step. If we 
denote the densities at iteration i by a, and consider a family 



of densities that includes {ya,- + (1 — y)a,- + i,y £ [0,1]} then 
the assumption made is that the family of densities does not 
change much in one step of the optimization. We could sample 
many points in this family to enforce the condition in the linear 
program that the EXIT curves do not intersect. However, we 
sample only at points a,. This is usually sufficient since if the 
old EXIT curves are close to each other, then we get many 
samples there and at other points we have more leeway so we 
can sample fewer times. 

In each step of the optimization procedure, we generate a 
new dd pair from the previous dd pair in two sub-steps. In the 
first sub-step we change X(x) keeping p(x) constant and in the 
next sub-step we change p(x) while keeping X(x) the same. 
The first sub-step is as follows. We choose p(x) = p id{x) 
and optimize X(x) as follows. We perform density evolution 
with the dd pair Q<. ld)9old) ar, d at the en d of each iteration 
store lL(d) which is the mutual information corresponding to 
the extrinsic output of a bit node of degree d at the end of 
iteration I. The optimization then reduces to the following 
linear program. 

maxV Xi/i 

2> = U,->o, 

£Vi(0>£wr i (0 
+P£W4(0-4 _1 (0) Pe[o,i)vz, (?) 

-8<Xi-X i dti <8 Vi, (8) 
'Xi < \ tttt (from @). (9) 

Before we explain the constraints, we note that the cost 
function corresponds to maximizing the rate and that the old 
dd pair satisfies the constraints and therefore the resulting rate 
is always larger than the old rate. 

The Constraint © basically represents the condition that the 
EXIT curve corresponding to the bit nodes should lie above 
that of the check nodes. The quantity Y^ald,i{l\,{i) ~ 1^ (0) 
is the gap between the two EXIT curves corresponding to the 
old dd pair. The constant (3 determines how much change in 
the gap is allowed. If (3 is chosen to be the gap between the 
curves can become zero while if (3 is chosen to be one the gap 
is kept the same. 

By choosing a smaller P we weaken the constraints and 
therefore get a larger rate. However, since the dd pair changes, 
the input densities also change and therefore the actual EXIT 
curves change. Since the gap between the approximate EXIT 
curves (one obtained using the previous densities) is smaller 
with smaller p, the chances of the actual EXIT curves in- 
tersecting increases. We choose some value of P, perform 
the density evolution with the new dd pair and check if it 
converges. If it does, we accept the new ensemble and go to 
the second sub-step. If it does not converge, we increase p 
and repeat this sub-step. 

The Constraint ^ is introduced so that the degree distri- 



butions do not change much in an iteration which in turn will 
ensure that the input densities and the resulting EXIT curves 
do not change significantly. 

The Constraint (0 is the stability condition. For the mod- 
ified MS algorithm with a > 1, we replace the stability 
condition by the condition A-2p'(l) < 1. 

In the second sub-step we perform the density evolution 
with the dd pair obtained in the previous sub-step and store 
I l c (d) which is the mutual information corresponding to the 
extrinsic output of a degree d check node at the end of iteration 
/. A linear program, similar to that discussed before, can then 
be used to optimize the rate. As mentioned before, the rate 
keeps increasing with each step of the optimization process. 
We stop the optimization when the increase in rate becomes 
insignificant. 

The linear program discussed above can be easily modified 
for the case when we have a fixed rate and we want to find 
a code with better threshold. This optimization procedure is 
available on-line at [5]. 

V. Optimization Results 

We used the optimization procedure discussed in this paper 
to design LDPC codes for MS. For fixed rate optimization 
scheme the gap to capacity varied significantly depending 
on the average right degree chosen. For the fixed channel 
optimization procedure, the final gap to capacity depended on 
the initial profile with which the optimization procedure was 
started however the variations were observed to be lesser than 
that in fixed rate optimization. 

In Fig. |2] we show the gap to capacity and the average 
right degree corresponding to LDPC codes optimized for MS 
decoding and modified MS decoding with a = 1.25. The 
fixed channel optimization procedure was used to obtain these 
points. We observe that the gap decreases as the rate increases 
but it is still quite far from the Shannon capacity limit. 

Comparison of the threshold of LDPC codes designed for 
BP but used with MS and the threshold of codes designed for 
MS shows that significant gains are obtained by using codes 
specifically designed for MS. For example, the best rate 0.5 
code designed for BP from [4] has a threshold of 1.91 dB 
with MS which is 0.97 dB worse than the best threshold we 
obtained for LDPC codes that were optimized for MS [5]. 

VI. Conclusion 

We derived a sufficient condition for the stability of the fixed 
point 8oo(7) which is also a necessary condition for the density 
evolution to converge to 8oo(7) when initiated with channel log- 
likelihood ratio density. It remains an open question whether 
this condition is also necessary for the stability of fixed point 
8oo(/) subjected to local perturbation. 

We have discussed some properties of densities which 
are observed to be empirically true. We proved that these 
properties remain preserved on the check node side. It remains 
to be seen if the same thing can be proved for the variable 
node side. 
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Fig. 2: Gap to capacity of some optimized profiles 



We presented a simple procedure to optimize LDPC codes 
for MS decoding. To the best of our knowledge, the obtained 
codes are the best codes reported so far for MS decoding and 
they perform significantly better than codes that were designed 
for BP but are decoded using MS. However, their performance 
is quite far from the capacity limit and it remains to be seen if 
the gap is due to the sub-optimality of the design procedure. 
On the other hand if the gap is due to the inherent sub- 
optimality of MS, it will be an interesting research direction 
to explain the gap by information theoretic reasoning. 

Appendix 

Lemma 3: Let a and b be two densities and c = aSb. Then 

B(c) <fi(a)+fi(b). 
Proof: For the sake of simplicity, in the proof we assume 
that densities a and b are absolutely continuous. However the 
proof also works in the general case. Let X and Y be two 
random variables with densities a and b respectively and Z = 
sign(X)sign(y)min(|X|,|y|). Then B(c) = E \e~% , 



B(c) = 



signl.ilsignlv)min(|.f|.|r|) 

- a(x)b(y)dydx, 



5 f 03 min(jc.v) 

/ (a (jc) b(y) + a (-x) b( -y) ) e ^~ dydx+ 
Jo 



JO 



min(jt,y) 



(a(jc)b(— y)+a(— x)b(y))e T^dydx, 



r^4{(a(x)b(y) + a(-x)b(-y)) e -5 
o Jo g{x,y) L 

(a(x)b(-y) + a(-x)b(y))e? } dydx+ 



£^{(a(,)bCy) + a(-*)b(-;y)) e -S 
o Jx g(x,y) I 



+ {a(x)b(-y) + a(-x)b(y))e$\dyd. 
h+h- 



x+ 



(10) 



In (a) we multiply and divide by g(x,y) = (a(x) + 
a(— x))(b(y) + b(— y)). Note that all the densities which arise 
in density evolution satisfy the property that a(x) =0 if and 
only if a(— x) is zero. Thus if a(x) or b(y) are equal to zero 
then the integrand itself is zero and those values of x and y do 
not contribute to the integral. Hence without lose of generality 
we can assume that a(x) and b(y) are not zero. Now, 



2? (a) 



(z(x)e 2 +a(— x)e 2 j dx, 

(b(y) + b(-y)) (a{x)e~i + a(-x)eS ) dydx, 



(a) 

lo Jo 

(b) r f*(a(x) + a(-x)) 



Jo Jo 



(a(x) + a(-x)) 
a(x)e~2 +a(— x)e^ dydx 
(a(x) + a(-x)) 



(b(y) + b(-y)) 



o Jx (a(x) + a(-x)) 
a(x)e~? + a(—x)e^dydx 
hi+hi- 



(b(y) + b(-y)) 



(11) 



In (a) we used the fact that /J° (b(y) +b(—y))dy = 1 and in 
(b) we multiply and divide by (a(x) + a(—x)). Similarly, 



B(b)= r f 
Jo Jo 

(_ 



*(b(y) + b(-y)) 



(b(y) + b(-y)) 
b(y)e~2 + b(-y)e^ aTyt/x 

(b(y) + b(-y)) 



(a(x) + a(-x)) 



(b(y) + b(-y)) 
(b(y)e~2 + b(-y)e^ afydx 

= 4l +/b2- 



(a(x) + a(-x)) 



(12) 



Note that by Eqn([T0] HI EE3, # (c) - B (a) - B (b) = h -hi ~ 
hi +h — hi — hi- We first consider I\ — hi ~hl- We prove 
that the integrand of I\ — hi — hi is pointwise non positive. 
As (a(x) + a(— x))(b(x) + b(— x)) is a common non negative 
factor in the integrands of h,hi an d ^bi, we will not consider 
it. Then the remaining integrand of h — 1 3 \ — I^i is: 



Lemma @] y < x and y is associated with q. The integrand 
of h — hi — hi can also be reduced to EqndTBll in Lemma [4] 
Hence we prove that B(c) <B (a) +B(b). ■ 
We define a Generalized BSC density by, 



x) (z)=p8- x (z) + (l-pMz)- 



gbsc(p,.v) 



Lemma 4: Consider a gbsc ( ; , x ) (z), a g b sc ( ?)> ,) (z) an d c(z) = 

a g bsc(p,x) (z) ^ a gbsc( ? jO (z)- Then 

B (c) — B (a g bsc(;;,x)) — -B (bgbscfey)) < 0- 

Proof: With out loss of generality we can assume that 
3? < x. Then 

c(z) = (p(l-9)+9(l-/>))S_,(z) + ( M +(l-p)(l- g ))8 y (z). 
Now, 

S(c) B (a gbsc ( PiA: )) -B(b gbsc ^ y )) = 
(p(l - 9) +9(1 - + (pq+ (1 - 9))^ 

a- _ x y _ v 

— pe2 — (1 — p)e 3—qe?-(l-q)e 5. 
= p(l -29) ( e 5 _ e -5) _ pe f _ (l-pje-!, 
<0, (15) 



a(x)b(y)e~2 + a (x)b(-y)e2 + a (-x)b(y)e2 + a (-x)b(-y) 



e~2 



(a(x) + a(-x))(b(y) + b(-y)) 
a(x)e _ 5 + a(— x)e2 b(y)e~2 4- b(— y)e2 



Define q 



a(x) + a(— x) 

_ b(-y) n _ a(-.v) 



(13) 



b(y)+H-y)' 



b(y) + b(-y) 
jy. Now we can write Eqn 



O as 

((1 - -9) + (p(l - q)+q{\-p))ei 

_ y v , _ x x 

— (1 — q)e z — qe? — (I — p)e 2 - pe? 



p(l-2q)(e 



2 -pe? - (l-p)e"2. 



(14) 



The Eqn(fT4li is exactly the Eqn(fT5ll in Lemma [4] which is 
proved to be non positive. Also note that as required by 



because 1 — 2g < 1 and y < x, we have p(\ —2q) [e? — < 
pe^ < 0. Thus we have prove the desired statement. 



References 

[1] N. Wiberg, "Codes and Decoding on General Graphs," PhD thesis, 

Linkoping University, Sweden, 1996. 
[2] R. Jotter and P. O. Vontobel, "Graph-covers and iterative decoding of 

finite length codes," in Proc. 3rd International Symposium on Turbo 

Codes, September 2003. 
[3] A. Amraoui, "Asymptotic and finite-length optimization of LDPC 

codes," Ph.d. Thesis, EPFL, June 2006. 
[4] A. Amraoui and R. L. Urbanke, LPDCopt, available at 

http://lthcwww.epfl.ch/research/ldpcopt/ 
[5] K. B hattad an d R. L. Urb anke, LPDCopt for Min-Sum, available at 

http://lthcwww.epfl.cli/researcli/bhattad/ 
[6] K. Bhattad and K. R. Narayanan, "An MSE based transfer chart to 

analyze and design iterative decoding schemes under the Gaussian 

assumption", to appear in IEEE Trans, on Inf. Theory. 
[7] J. Chen and M. Fossorier, "Near optimum universal belief propagation 

based decoding of low-density parity-check codes," IEEE Trans. Com- 

mun., vol. 50, pp. 406414, Mar. 2002. 
[8] S. ten Brink, "Convergence behaviour of iteratively decoded parallel 

concatenated codes," IEEE Trans. Commun., vol. 49, pp. 1727- 1737, 

Oct 2001. 

[9] T. J. Richardson and R. L. Urbanke, "The capacity of low-density parity- 
check codes under message passing decoding," IEEE Trans. Inf. Theory, 
vol. 47, no. 2, pp. 599-618, Feb. 2001. 

[10] T. Richardson, A. Shokrollahi, and R. Urbanke, "Design of capacity 
approaching irregular low-density parity-check codes", IEEE Trans. Inf. 
Theory, vol. 47, no. 2, pp. 619-637, Feb. 2001. 

[11] A. Anastasopoulos, "A comparison between the sum-product and the 
min-sum iterative detection algorithms based on density evolution," in 
Proc. Globecom 2001, San Antonio, TX, Nov. 2001. 

[12] J. Feldman, D. R. Karger, and M. J. Wainwright, "Using linear pro- 
gramming to decode linear codes," in Proc. 37th annual Conference 
on Information Sciences and Systems (CISS '03), Baltimore, MD, Mar. 
12-14 2003. 

[13] P.O. Vontobel and R. Koetter, "On the relationship between linear 
programming decoding and min-sum algorithm decoding," in Proc. 
ISITA 2004, Parma, Italy, pp. 991-996, Oct. 10-13, 2004. 



